Openvla 7b Oft Finetuned Libero Spatial
MIT
OpenVLA-OFT is an optimized vision-language-action model that significantly improves the running speed and task success rate of the basic OpenVLA model through fine-tuning technology.
Multimodal Fusion
Transformers